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Abstract —In this paper we propose a pair of low-complexity 
user selection schemes with zero-forcing precoding for multiuser 
massive MIMO downlink systems, in which the base station 
is equipped with a large-scale antenna array. First, we derive 
approximations of the ergodic sum rates of the systems invoking 
the conventional random user selection (RUS) and the location- 
dependant user selection (LUS). Then, the optimal number of 
simultaneously served user equipments (UEs), K*, is investigated 
to maximize the sum rate approximations. Upon exploiting K *, 
we develop two user selection schemes, namely K *-RUS and K*- 
LUS, where K* UEs are selected either randomly or based on 
their locations. Both of the proposed schemes are independent 
of the instantaneous channel state information of small-scale 
fading, therefore enjoying the same extremely-low computational 
complexity as that of the conventional RUS scheme. Moreover, 
both of our proposed schemes achieve significant sum rate 
improvement over the conventional RUS. In addition, it is worth 
noting that like the conventional RUS, the K *-RUS achieves good 
fairness among UEs. 

Index Terms —User selection, massive MIMO, low-complexity, 
system sum rate, user fairness. 


I. Introduction 

The multiuser MIMO (MU-MIMO) technology plays a 
key role in modern wireless communications due to its sub¬ 
stantial performance gains over the conventional single-input 
single-output (SISO) techniques o, na. Relying on MU- 
MIMO, a multi-antenna base station (BS) can simultaneously 
serve multiple user equipments (UEs) within a cell using the 
same spectrum resource, and thus the spectral efficiency is 
improved. User selection is critical for optimizing MIMO 
systems’ overall performance in a variety of scenarios and has 
been extensively studied, such as in cellular networks (see for 
example 0 and references therein) and in multi-hop networks 
m-m. The semi-orthogonal user selection (SUS) leveraging 
the degree of channel orthogonality among UEs is probably 
one of the most popular low-complexity user selection meth¬ 
ods for improving system sum rates 0 , 0 , 0 . Additionally, 
considering the fairness amongst UEs, round robin scheduling 
0 and random user selection (RUS) are regarded as the 
two simplest methods offering equal opportunities to all the 
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candidate-UEs. Both of them have been widely employed in 
practical cellular systems as well m. 

In the realm of MU-MIMO, the recently proposed massive 
MIMO, where the BS is equipped with a large-scale antenna 
array to serve multiple UEs, has been widely envisaged as one 
of the major candidate technologies for the fifth generation 
(5G) cellular networks owing to its favorable features, such 
as huge spectral efficiency and energy efficiency gains ffTOl - 
El, El, El- Like in the conventional small-scale MU- 
MIMO systems, user selection is also important in massive 
MU-MIMO systems [ 16]]-[ 19|, though it faces new challenges. 
More specifically, in the user selection for conventional MU- 
MIMO systems, it is usually assumed that the number of 
candidate-UEs, N, is much larger than that of the BS antennas, 
M. Therefore, upon employing instantaneous channel state 
information (CSI)-aided user selection methods (e.g. SUS), 
multiuser diversity gains can be harvested to boost the overall 
system performance. By contrast, in massive MIMO systems, 
it is impractical to have N M, since M is already 
very large. Moreover, the computational complexity of the 
conventional user selection methods might be too high for 
the massive MIMO systems. For example, the computational 
complexity of SUS is roughly 0(M 3 N ) 0, which will cause 
huge consumption of power and computational resources if M 
becomes large. 

Recently, a range of user selection schemes have been pro¬ 
posed for massive MIMO systems. The time-division duplex 
(TDD) and frequency-division duplex (FDD) based massive 
MIMO systems impose different requirements on user selec¬ 
tion. By exploiting the instantaneous CSI of candidate-UEs, 
Lee et al. proposed an SUS-like user selection method in [[18 ] 
and Xu et al. developed a greedy user selection scheme in EES). 
These selection methods mainly focus on FDD scenarios, in 
which the amount of downlink transmission resources con¬ 
sumed by the downlink channel estimation training for all the 
candidate-UEs does not increase with the number of candidate- 
UEs N m . By contrast, in TDD scenarios, the downlink 
channel is estimated through uplink training relying on channel 
reciprocity, and the pilot/training symbol overhead imposed 
by channel estimation increases with N f2Tl . In this scenario, 
if the number of candidate-UEs is large, most of the channel 
coherence slot in time domain will be consumed by channel es¬ 
timation, leaving only a small fraction for downlink data trans¬ 
mission. Hence, besides the high computational complexity, 
the pilot overhead needed for channel estimation of candidate- 
UEs also limits the application of instantaneous CSI-aided 
user selection methods in TDD scenarios. In addition, for 
FDD based massive MIMO systems, Nam et al. introduced 
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user selection methods based on the candidate-UEs’ feedback 
of instantaneous signal-to-interference-plus-noise ratio (SINR) 
in E3- However, these methods are not applicable for TDD 
systems either. This is because in TDD scenarios, the feedback 
signals from a large number of candidate-UEs will increase 
the uplink proportion of the uplink-downlink shared frequency 
band at each coherence slot, resulting in reduced resources 
left for downlink data transmission. In summary, neither the 
instantaneous-CSI estimation based nor the uplink-feedback 
based user selection methods are suitable for the downlink of 
TDD-based massive MIMO systems. At the time of writing, 
the design of user selection for massive MIMO systems with 
the TDD mode remains a largely open area. Hence, the novel 
user selection methods which cause no or just little decrease 
of downlink transmission resources represent a new promising 
research subject. 

In this paper, we consider the downlink of a TDD based 
massive MIMO system where pilot-based channel estimation 
and zero-forcing (ZF) precoding are invoked for serving a 
number of UEs. First, with the aid of the random matrix 
theory (RMT)-based large system analysis, we derive approx¬ 
imations of the ergodic sum rates of the systems invoking the 
conventional RUS and the location-dependent user selection 
(LUS). The optimal number of simultaneously served UEs, 
denoted as K*, is solved offline for maximizing the sum 
rate approximations. Then, aiming for improving the system 
sum rates, a pair of K *-based low-complexity user selection 
methods are proposed, namely the K *-based random user 
selection (K*- RUS) and the K* -based location-dependant 
user selection (if*-LUS). For if*-RUS, if* UEs are randomly 
selected for simultaneous data transmissions at each time 
slot. The system sum rates are improved with an appropriate 
configuration of if*. Meanwhile, the fairness among UEs is 
guaranteed as a result of the random selection. For the if*- 
LUS scheme, if* UEs nearest to the BS are selected for data 
transmission, which may achieve higher sum rate performance 
than if*-RUS. 

Notably, our schemes exhibit two fundamental differences 
as compared with the conventional user selection schemes. 
First, unlike the conventional SUS that requires the instan¬ 
taneous CSI of small-scale fading (SSF), our schemes only 
need long-term CSI. Second, rather than emphasizing which 
UEs should be selected for improving system performance, 
the proposed user selection schemes mainly focus on how 
many UEs should be selected for simultaneous transmissions. 
Thanks to these differences, we bypass the complicated online 
computations regarding the sum rates and the selection metric, 
which are often inevitable in the conventional schemes. There¬ 
fore, the online computational complexity of the proposed two 
schemes is on the same order as that of the conventional RUS 
scheme. Furthermore, since our user selection schemes are 
independent of the SSF CSI of candidate-UEs, we no longer 
have to carry out channel estimation of all the candidate-UEs 
for user selection. Instead, only the active-UEs need to send 
pilots at each coherence slot. As a beneficial result, we are 
capable of saving the cost of channel training significantly 
and attaining more resources for data transmission. 

It is worth pointing out that a location-adaptive transmission 



Fig. 1. The downlink of a TDD based massive MIMO system, which is 
composed of an M- antenna BS and N single-antenna candidate-UEs. Among 
all the candidate-UEs, K UEs are selected to be simultaneously served, which 
are regarded as the active-UEs. 

strategy was proposed for TDD based massive MIMO systems 
in m. Our work differs from m in several respects. 
Random locations of UEs are assumed in this paper, whereas 
in |22] the UEs were assumed to be placed at fixed points. 
Therefore, the optimal number of active-UEs in our paper is 
independent of specific channel realizations, while in l22l this 
number has to be re-calculated whenever any candidate-UE’s 
large-scale-fading (LSF) CSI changes. Moreover, we consider 
spatial correlation in the channel model, which is ignored in 

m. 

The remainder of this paper is organized as follows. The 
system model is presented in Section II. In Section III, we 
analyze the asymptotic sum rates of the system invoking the 
conventional RUS and LUS, and then further develop two low- 
complexity user selection schemes. Our numerical results are 
provided in Section IV. Finally, the conclusions are drawn in 
Section V. 

Notations: We use uppercase and lowercase boldface letters 
to denote matrices and vectors, respectively. (-) H , (*)^ and 
tr(-) denote the conjugate transpose, the pseudo-inverse, and 
the trace operations, respectively. E x [•] represents the expected 
value with respect to x. CAT(m, 0) denotes the circularly 
symmetric complex Gaussian distribution with mean vector 
m and covariance matrix 0. Finally, denotes the almost 
sure convergence. 

II. System Model 

We consider the downlink of a TDD based massive MIMO 
system consisting of an M-antenna BS and N single-antenna 
candidate-UEs (N > M). We assume that N and M are of the 
same order. K UEs (K < M ) are selected for simultaneous 
data transmissions at each coherence slot. The composite 
channel matrix G G C KxM from the BS to the K active-UEs 
characterizes LSF, SSF and transmit correlatiorQ, and can be 
expressed as 

G = D 1 / 2 HR 1 / 2 , (1) 

where H G C KxM is the SSF matrix with independent 
and identically distributed (i.i.d.) CJ\f(0, 1) entries, and the 

'The distance between UEs is supposed to be sufficiently large compared 
to the signal wavelength, so the receive correlation is not taken into account. 











3 


diagonal matrix D G R KxK contains the LSF coefficients /3k 
along its main diagonal. We model the LSF of the k- th active- 
UE as@ Pk = cd k a , k = 1,..., K, in which dk is the distance 
from the k- th active-UE to the BS, is the pathloss exponent 
and c is the pathloss at the reference distance. The transmit 
correlation matrix at the BS is modeled by the widely used 
£-Kac-Murdock-Szeg? matrix R, in which S is the antenna 
correlation coefficient m- 

The system operates in the TDD mode and the BS obtains 
the SSF CSI relying on the training-based channel estimation. 
The estimation H of the SSF CSI matrix H is modeled as 

H = H + H, (2) 

where the K x M dimensional estimated channel matrix H = 
[hf, • • • , h^] T and the K x M dimensional error matrix H = 
[hf, • • • , h^] T can be expressed as 


It should be noted that we employ the equal power allocation 
among active-UEs for the sake of low computational com¬ 
plexity. Relying on ©, the K x 1 dimensional received signal 
vector at the K active-UEs is expressed as 

y = Gx + n 

~^ + (7) 

= 7 s + 7 GG's + n, 

where n = [ni, • • • , rik] T G C Kxl is the additive white Gaus¬ 
sian noise (AWGN) vector at the UEs, and rik ~ CAT(0,cr^) 
represents the noise at the k- th active-UE. Additionally, the 
received signal received at the k- th active-UE is given by 

Vk = 7 Sk + 7 \! cc ^ a hfcR- 1/2 G t s + n k . (8) 

Then, we can write the SINR recorded at the k- th active-UE 
as 


H = v^Zi, H = V^Z 2 , 0<p<1. (3) 

hjk G C lxM and hfc e C lxM are the fc-th rows of H and 
H, respectively. Both Zi G C KxM and Z 2 G C KxM are 
composed of i.i.d. CJ\f(Q, 1 ) entries, and the two matrices are 
independent with each othei@. We assume that the LSF CSI of 
each UE and the transmit correlation matrix R are perfectly 
known at the BS. According to © and the downlink 
channel matrix defined in ffl can be rewritten a£| 

G = G + G 

(4) 

= d7 2 HR 1/2 + D 1 / 2 HR 1/2 . 


Upon invoking the estimated channel matrix G and the ZF 
precoding, the transmitted vector x G C Mxl is written as 

x = 7 (U s 

= 7 G H (GG ff )- 1 s, (5) 

where 7 is the power controlling factor, s = [s\, • • • , sk] t is 
the K x 1 information-bearing symbol vector and Sk represents 
the symbol intended to the k- th active-UE. Denoting the 
available total transmit power at the BS as P, the long-term 
power constraint is given by 

E[tr(xx ff )] < P. 

Thus, 7 can be calculated as 

7 ~~ \jtr(GG H )~i' (6) 


SINR fe = 


<^n + cd~ a 7 2 h fc R 1 / 2 Gtss«(Gt) ff (R 1 / 2 ) ff hf 


(9) 


Then, the sum rate 1Z is given by 




K 




K 

]Tlog 2 (l + SINR fe ), 
k =1 


( 10 ) 


where IZk is the rate of the k -th active-UE, and T denotes 
the number of symbols over which the channel is constant. 
As a percentage, the pre-log factor (1 — K/T) implies that 
the downlink data transmission only occupies a fraction of 
the coherence slot. In particular, we assume that K UEs are 
simultaneously served. Since each of the K UEs is assigned 
one of the K orthogonal pilot sequences, the length of the 
pilot sequence should not be shorter than K symbols fill . 
For simplicity, in this paper we adopt the shortest available 
pilot sequence of K -symbol length. Because the BS does not 
transmit data during the uplink pilot transmission for channel 
estimation, there exist (T — K) symbols left for downlink data 
transmission at each coherence slot consisting of T symbols. 
Therefore, the sum rate for downlink data transmission may 
be evaluated using (TTOl) . Note that in order to guarantee the 
feasibility of data transmission, we assume K < T in this 
paper. Otherwise, the pilot transmission would occupy the 
entire coherence slot T. 


2 In this paper, we use a simplified LSF model in which the shadow fading 
is excluded. Nevertheless, it should be noted that algorithms and schemes 
developed in this paper can be directly extended to the model including 
shadow fading. 

3 Note that the imperfect CSI model invoked here is similar to that of ED 
and different from that employed by 1241 and l25l . In (24] and (25]. the 
authors assumed H = H + H, where H and H are mutually independent. 

4 In fact, as shown in ED, the explicit forms of G and G is related to 
the channel estimation approach, the power and the length of the training 
pilots, as well as the statistical information of the estimated channel. Here, we 
adopt a simplified model for tractability. Some tailored user selection schemes 
relying on a specific channel estimation algorithm (e.g., LS, MMSE) will be 
postponed for our future work. 


III. Asymptotic Sum Rate Analyses-based 
Low-Complexity User Selection 

In this section, the proposed user selection schemes are 
presented. First, relying on the RMT-based large system anal¬ 
ysis, we derive a deterministic approximation of the ergodic 
sum rate of the ZF precoder aided massive MIMO system. 
This result brings new insights into the question of how to 
enhance the system sum rate performance. Then, a pair of 
low-complexity user selection schemes are proposed based on 
the attained approximation of the sum rate. 
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A. Sum Rate Approximation in the Large-System Regime 

We first evaluate the value of SINR& for the scenario where 
M and K go to infinity with a finite ratio M/K > 0. Then, 
the approximation of the ergodic sum rate is derived. 

Applying RMT-based large-system analysis, we reveal that 
SINR/e may be characterized by (please see Appendix for the 
detailed derivation) 


SINR/e 

where 


£f=i df(A(K,M) + B(K,M)d^ a ) 


, (ID 


In (fl2l) . (j) is the unique solution of the equation 


1 


M 

and ^ is defined as 
1 


K 1 


= -tr|R|I„ + --R 


K 1 


-i> 


— 2 ^ 


( 12 ) 


' / ’=M ,r ( R ( Im+ M^ R 

Exploiting (ITOl) and (fill) , the ergodic sum rate E[R] in large- 
system regime can be formulated as 


1 


m\ = 

> R, 


K 

T 


E d! 


K 


^log 2 (l + SINR fe ) 


lk=l 


in which 1Z is defined as 


K 


K 


^ = 1 - £ E lo S2(l + SINR*) 


k= 1 


and “>” is obtained by applying Jensen’s inequality 

1 


log 2 1 + 


> log 2 1 + 


E\x] 


Additionally, SINR& is given by 


(13) 


where 


SINR fc = 


T\ = A(K, M), 


T x dl +T 2 d7 a +T 3 ’ 


K 


T 2 = B(K,M) e dAd?i 

K 

T 3 =A(K,M) E E di [d?]+B(K,M). 

i=l,i^k 


(14) 


In this paper, we employ the lower bound 1Z defined by 
(fl3l) as an approximation of the system sum rate. It should 
be emphasized that LZ is independent of the SSF CSI of UEs, 
which constitute the basis for our designs. 


Remark 1. The results obtained above are invalid for the 
scenario of K = M due to mathematical intractability. 
Fortunately, as shown by our simulation results that are given 
in Section HV-A1 K = M is rarely beneficial for enhancing the 
performance of the considered massive MIMO system. This 
phenomenon was also observed by 1161 and ll24l . Therefore, 
in this section we focus on the sum rate approximation for 
K <M. 

B. K*-Based Random User Selection (K*-RUS) 

In this subsection, we develop a novel RUS scheme, namely 
if*-RUS, for the sake of improving the system sum rate 
and ensuring the fairness among candidate-UEs. Specifically, 
compared with the conventional RUS scheme in which M 
UEs are selected for simultaneous data transmissions at each 
coherence slot, we modify the number of active-UEs to a 
more appropriate value K *, which is decided according to the 
system parameters (e.g., the transmit power P of the BS) and 
the statistical information of the channel (e.g., the probability 
distributions of SSF CSI and LSF CSI). 



Fig. 2. The diagram of UE locations. 


In order to obtain iT^ us , we need to consider the user 
distribution, which facilitates characterizing the effects of LSF. 
As shown in Fig. [2J in this paper we employ the common 
circular cell model G3, where all the N candidate-UEs are 
independently uniformly distributed (i.u.d.) in a circular cell 
having an inner radius of R m i n and outer radius of i? m ax> 
while the BS is located at the center point. We denote the 
locations in polar coordinates, i.e., (r/c, Ok) is the location for 
the k- th candidate-UE and (0, 0) for the BS. Therefore, the 
probability density functions (PDFs) of rk and Ok are given 
by (the subscript k is omitted below for ease of notation) 


/«(>) = 


2 r 


e >2 _ C >2 

-H^max -Han in 


, Rmin E k ^ -Rmaxj (15) 


and 


feW = 


respectively. The cumulative distribution functions (CDFs) of 
rk and Ok are written as 


f r (r) = f jEl } Rmin < r < i? max , ( 16 ) 

-H'max -H'min 




















5 


and 


F e (9) 


9 

2tt’ 


respectively. 

As a result, for AT*-RUS, the PDF of the distance from the 
/.:-lh active-UE to the BS, i.e. / ( ^ us (r), is given by 


fd* S ( r ) = MU R min <r <R„ 


k = 1,..., K. 


Then, it is easy to obtain 

K 


£ B di [df} = (K-l) f 

-1 J Rn 


r a fn{r)dr 


i=l,i^k 


2 (K - l)(iCt* 2 - R «j n 2 ) 
(a+ 2)(R 2 max -R 2 min ) ' 

Thus, 1Z in (f]~3l) can be rewritten as 
K N 


RrUS = 1 — 


s 

Jr* 


K 


log 2 1 + 


1 


Tir« + T 2 r-« + T 3 


(17) 


fr(r)dr. 

(18) 


Substituting (fl2l) and (IT71) into (fl4l) . and then substitut¬ 
ing (fl4l) and (fl5l) into (fl8l) , Rrus can be reformulated 
as (O which is given on the next page. We can see 
that (fl9l) is only related to the system parameters of 
T : M, K, -R m in5 Rmaxj C, CT n , P, Cl and $. 

Given these system parameters, the optimal number A^ us 
in the sense of Rrus maximization can be obtained effi¬ 
ciently with an one-dimensional search over the candidate set 
{1,2,...,M-1}, i.e., 


K* 

RUS 

= arg max T i (T, M, K, R n 


15 Rmax 5 Pi Cf n i P-) Cl, S ). 

( 20 ) 


Obviously, Rr US is independent of any instantaneous CSI, 
which makes it possible to find Rr US offline. 

After obtaining Rr US , the only online operation in the 
proposed R*-RUS scheme is to randomly select Rr US UEs 
for simultaneous data transmissions. Therefore, little extra 
computational complexity is imposed on the proposed K*- 
RUS compared to the conventional RUS. 

Remark 2. The authors of [24), [28l and |[29) also discussed 
the optimal number of UEs for the ZF precoding, but they 
treated the LSF coefficients of UEs as deterministic values, 
which limit the generality of K*. In other words, K* has to 
be updated whenever any LSF CSI of the system changes. 
By contrast, we take random UE locations into account for 
obtaining a more general and practical K*. As long as the 
statistical properties of the system remain unchanged, our R* 
keeps its current value for arbitrary LSF and SSF channel 
realizations. 


C. K*-Based Location-Dependant User Selection (K*-LUS) 

In order to enhance the sum rate performance further, a 
K *-based location-dependant user selection scheme, namely 
the iT*-LUS is developed in this subsection. In AT*-LUS, we 
select an appropriate number of UEs in descending order of 
the LSF coefficients (i.e., in ascending order of the BS-UE 
distances) for simultaneous data transmissions. 

Inspired by K*- RUS, let us first investigate the optimal 
number of active-UEs in LUS, which is denoted by Rrus- W e 
assume that K UEs are selected and the distances between 
the selected UEs and the BS satisfy d\ < d 2 < • • • < dx- Let 
(rk,0k) represent the k -th active-UE according to the distance 
in ascending order. According to (fl6l) . we can get the PDF of 
the order statistic <4, k = 1 5 ..., K in LUS as 

A US(rfe)= B(Mv^FrT) F «” 1)(rfe)[1 - Ffl(rfc)]N “ fe/fl(rfc) 

B(k,N-k + l)(Rl ax -Ri in )" ’ 

( 21 ) 


in which B(x, y) represents the Beta function with parameters 
x and y. Then, we have 

EdJC] = (7,—|;AT + 1;1-||A) (22) 

and 


E d k 


l°g 2 (l + SINRfc) 




T!r“ + T 2 r-“+T 3 


fdk S (n)dr k , 

(23) 


where 2 Fi(-) is the ordinary hypergeometric function [f3Ql . 
Therefore, Rlus can be formulated with the aid of (fl2l) . (fl4l) . 
(|2T1) . (f22l) and (123b , as shown in (f24l) on the next page. 

If the system parameters are fixed, we are capable of solving 
for the optimal number ATlus maximizing 1Z lus offline relying 
on standard line search algorithms, i.e., 

K* 

LUS 

= argmax r 2 (T, M, K, R min , i? max , P, c, o 2 n ,p, a, 5). 

/\C{1. M 1) 

(25) 


As long as we find RruS’ we j ust nee d to sort UEs in ascending 
order of their distances from the BS and select the first Rlus 
UE s for data transmission. 

Remark 3. With respect to the number of active-UEs K , 
we determine K = ATr US and K = R£ us according to the 
given system parameters in the proposed AT*-RUS and K*- 
LUS schemes, respectively. By contrast, there usually exists 
the scenario of AT = M, where the full-spatial-multiplexing 
transmission may be performed in the conventional RUS and 
LUS schemes. In SUS, the value of R* depends on specific 
channel realizations and will not be obtained until the selection 
procedure is completed (please see Section IV-B and Section 
VI-A in 0 for more details). 
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T^RUS 

= T 1 (T,M,K,R n 

IC 

T 


, Rn 


5 R) ^5 Cn 5 P") ^5 


= 1 - 


K 


x 


rRir 

J -Rmii 


2r 


c >2 _ c >2 

^max J bn in 


log 2 l + 


Pc<j>M 


2 (pr-i)(fi“+^-p; 

(«+2)(PL>x--R* 


£rA 

in) J 


/ 2(K-l)(flg,+|-fl“+ 2 ) 

M 02 -XV- V (a+ 2 )(Pi iax -«L„) 



dr. 

(19) 


^LUS 

= r 2 (X 1 , Ad, dd, d? m i n , d? max , X, c, o" n , p, cy, (5) 

(‘-^rnax 2 Vfc(Rf _ ^2 \N~k ( ^,2 


= 1 - 


?)£/, 

7 fe=i 




B(fc,AT-fc + l)(^ ax -^ in ) 


c >2 \/c —1 / 

minj log 2 1+ 


AT 


1 -P 


( r a _|_ 0 /?, ( b _ « • 

PccpM yk ' ^bnin 2 ^! 2’ 


JV+ 1 ; 1 - ^ 


:)) 


Pp 

M<p 2 — Kip 


Fi 




JV + 1 ; 1 - Jb 


bO 




(24) 


D. Computational Complexity Analysis 

For a system having M BS antennas and TV candidate- 
UEs, although the user selection relying on exhaustive 
search achieves the best sum rate performance, approximately 
Xlfcli (^)& 5 TU complex-valued operations are required to 
complete one selection ED, which may be unaffordable in 
practice. For SUS, the computational complexity is roughly 
G(M 3 N) 0, which is high for large-M systems. In stark 
contrast to these conventional schemes, the online computa¬ 
tional complexity of the proposed K *-RUS and K *-LUS is 
independent of M and TV, and the instantaneous CSI-based 
complicated online computations are avoided. As a result, the 
computational complexity is approximately 0(1), which is just 
the same as that of the conventional RUS scheme. 


E. Performance Analysis for the Special Case of p = 0, 5 = 0 

All the above investigations are subject to the general case, 
i.e., in the context of the systems with imperfect CSI and 
transmit antenna correlation. In this subsection, we consider a 
special case in which there exists neither channel estimation 
error nor transmit antenna correlation, i.e., p = 0, S = 0. In 
this context, because we can obtain clearer insights into how 
the system performance is affected by different user selection 
schemes. 

For p = 0, S = 0, we have 


A{K ' M) = Pd~M^~K)' 
B(K,M) = 0. 


Substituting them into (I~f9l) and (l24l) . the approximate sum 
rates can be calculated. Nevertheless, the integrals of logarith¬ 
mic functions in (IT9l) and (l24l) degrade the intelligibility of the 
results. 


Here, a new method, which is different from those adopted 
in Section EE! and Section IIII-C1 is developed for finding a 
much simpler expression of the ergodic sum rate approxima¬ 
tion in this special case. In what follows we apply Jensen’s 
inequality in a slightly different manner for the sake of 
finding a more concise expression of the system sum rate 
approximation. More specifically, we have 




E dx,...,d K 


" K 

y^log 2 (l + SINR fc ) 


lk=l 

iVlog 2 (l + SINR), 


where 


SINR = 


Pc(M - K ) 

a n S/c=l E d k [d%] 


The approximate sum rate 1Z for iV*-RUS is then given by 
K* 


Rrus — I 1 “ 


RUS 


K* 

RUS 


X log 2 1 + 


Pc(M - K* vs )(a + 2)(^Lx - Rl in) 


max 

2 *IK(R“ + J - iC+f) 


(26) 


For iV*-LUS, the sum rate is approximated as 


Rh\JS — I 1 “ 


X log 2 1 + - 


Ki 


LUS 


K* 

LUS 


Pc(M - K£ vs ) 


X ^ K LUS DQ p (p _ a . AT 11 . 1 _ ^maxj i 

■ Z^ik—1 [yi 2 ’ 1 RfCj/ 


(27) 


Compared to (fl9l) and (l24k there is no integral calculation of 
logarithmic functions in (l26l) and (1271) . which simplifies the 
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system sum rate expression. Furthermore, as shown in Fig. [3] 
(l26l) and ([271) are capable of providing tight approximations in 
the case of p = 0, S = 0. 

According to (l26l) and (l27l) . it is clear that both of the sum 
rates 7 £r US and 7££ us increase when P and M become larger. 
Additionally, when N increases, the sum rate of AT*-LUS 
increases, while the sum rate of AT*-RUS remains unchanged. 

As far as SUS is concerned in the TDD scenario, as proved 
in 0 , the ergodic sum rate is upper bounded by 7^sus> which 
is given by 


T^sus = 



M0(log 2 log 2 TV), 


(28) 



where [•] + is defined as [x] + = max{x,0}. Due to the pilot 
overhead imposed by the channel estimation, the system sum 
rate is scaled by the factor [1 — N/T] + . More specifically, in 
this case the instantaneous CSI of all the N candidate-UEs 
are required for select ingactive-UEs at each coherence slot. 

Comparing the pre-log parts [1 — N/T ] + in (l28k (1 — 
K^js/T) in ([26l) and (1 — AT£ US /T) in (l27i we can see 
that the advantages of K*- RUS and K*- LUS are obvious 
for the systems relying on pilot-based channel estimation. In 
particular, when T is not significantly larger than N, with 
regard to SUS, a large portion of the coherence slot would be 
dedicated to channel estimation, which reduces the resources 
for the downlink data delivery. By contrast, with the proposed 
user selection schemes, we only have to estimate the CSI of 
the FfR US or Kl vs active-UEs for precoding. Both A"r US and 
iT£us are usually much smaller than TV, hence our schemes 
are superior to SUS by exploiting more data transmission 
resources. 

Remark 4. The result of 0(log 2 log 2 N) in (l28b is obtained 
when we have N -A oo and a fixed M 0. Moreover, as shown 
in [f32l . when the number of candidate-UEs N is linearly 
related to the number of BS antennas M (i.e. the case we have 
discussed in this paper), only marginal multiuser diversity gain 
might be achieved by SUS. 

IV. Numerical Simulations and Discussions 

In this section we present simulation results to show the 
benefits of the proposed user selection schemes. The cellular 
model employed is based on that of [fTbl and [33]]. Like lITOl . 
we assume the number of symbols in a coherence slot is 196, 
over which the channel is constant, i.e., T = 196. The cell 
radius is A max = 250 m and the minimum distance is R m i n = 
35 m. The pathloss exponent is a = 3.76 and the reference 
LSF factor is c— 10 -3,53 . The total noise power is assumed 
as = —96 dBm. 

A. Sum Rate Performance 

In Fig. |3l the sum rate performance of AT-RUS and K- 
LUS as a function of K is evaluated in the case of P = 30 
dBm, M — 32 and N = 64. The simulated ergodic sum rates 
(marked as ’ Sim.’ in the figure) are obtained by averaging over 
10000 independent channel realizations (both SSF and LSF 
CSI are regenerated at each realization). It is observed that 


Fig. 3. Sum rates of AT-RUS and AT-LUS. P = 30 dBm, M = 32, N = 64. 
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Fig. 4. Contour figures of A^ us , iV* us , f?* us and A^us a § a i nst P an d N. 
p = 0, S = 0, M = 32. AT| US and A^us are shown in (a) (i.e., the top-left 
sub figure) and (b) (i.e., the top-right sub figure) with solid lines, respectively. 

us an d Rlus are shown in (c) (i.e., the bottom-left sub figure) and (d) (i.e., 
the bottom-right sub figure) with dash lines, respectively. 


there exist ATr US and ATl US which maximize the system sum 
rate for Ef-RUS and Ef-LUS, respectively. Moreover, we have 
^rus < M and K^us < M for the simulation parameters 
considered. In Fig. [3] we also show the approximate sum rates 
given by (fl~9b and (l24l) . which are very tight. Therefore, it is 
reasonable to design user selection schemes based on them. 

In order to provide intuitive insights into how ATr US and 
i^LUs are affected by P and N, we evaluated 1 Cr US and i^L US 
with no transmit correlation and perfect CSI estimation (i.e., 
p = 0, S = 0) in Fig. |4](a) and (b), respectively. It is observed 
that both if* us and ATl US increase with the transmit power 
P. On the other hand, when we increase N, A^ug increases 
and if* us remains unchanged. This can be easily explained 
from the perspective of multiuser diversity gain. In particular, 
A^lus i s related to N because we select UEs according to LSF 
in A"*-LUS. By contrast, A"r US is independent of N because 
random selection is adopted in AT*-RUS. Moreover, Fig. [4] (c) 
and Fig. 0] (d) provide the sum rates of AT*-RUS and K*- 
LUS. As expected, we can see that both 7£r US and 7^l US are 
enhanced by the increase of P. When N increases, 7^l US rises 
but 7 £r US remains unchanged. 

In Fig. [5] the impacts of channel estimation accuracy p and 
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Fig. 5. Contour figures of ^rus >-^lus > Rrus an ^ R£us a § a i nst P an d $• 
P = 30 dBm, M = 32, N = 64. i^R US and if* us 310 shown in (a) and (b) 
with solid lines, respectively. R£ us and F^us are s ^own in (c) and (d) with 
dash lines, respectively. 


channel correlation factor 5 on the system performance are 
characterized, where we set P = 30 dBm, M — 32 and N = 
64. In sub-figures (a) and (b), the optimal number of active- 
UEs for if*-RUS and if*-LUS are shown, respectively. We 
can see that both K^j S and if^ attain their maximum values 
at p = 0, S = 0. This observation indicates that the BS should 
serve more UEs under uncorrelated channel scenarios with 
perfect CSI estimation than those under correlated channel 
scenarios with imperfect CSI estimation. The sum rates of 
if*-RUS and if*-LUS are shown in the sub-figures (c) and 
(d), respectively. It is clear that the sum rates decrease as S 
and p increase. 

The sum rate performance of various user selection 
schemes, including if*-LUS, if*-RUS, SUS and RUS, is 
evaluated against the transmit power P with M = 32, N = 64 
in Fig. [6l Four scenarios are investigated, i.e., uncorrelated 
channel with perfect CSI estimation (p m 0, 5 = 0), correlated 
channel with perfect CSI estimation (p = 0, S = 0.5), uncorre¬ 
lated channel with imperfect CSI estimation (p = 0.1,5 = 0) 
and correlated channel with imperfect CSI estimation (p = 
0.1,5 = 0.5). Note that for the conventional RUS, we 
randomly select M UEs for simultaneous data transmissions. 
For SUS, in order to ensure fair comparisons among all the 
four schemes, we adopt the equal power allocation instead of 
the water filling allocation. Furthermore, the optimal value of 
f*sus> which is an important parameter in SUS (described as 
a in ox is used for the SUS scheme in our simulation^]. As 
expected, if*-LUS achieves the best sum rate performance 
among the four schemes, and compared with RUS, if*-RUS 
also achieves a significant sum rate improvement. In addition, 
due to the non-negligible channel estimation pilot overhead 
and the lack of multiuser diversity gain, the conventional 
SUS scheme achieves similar [e.g., in sub-figures (a) and 
(b)] or even worse [e.g., in sub-figures (c) and (d)] sum rate 

5 The sum rate performance of the SUS scheme is highly sensitive to the 
choice of asus • The optimal value of asus varies with the changes of M, N 
and P. By means of searching over the interval (0,1], we obtain optimal 
values of «sus maximizing the sum rates of the SUS scheme for different 
configurations of M, N and P. 


Fig. 6. Sum rate 7 Z vs. transmit power P. M = 32 and N = 64. 


(a), p = 0, S = 0 (b). p = 0, S = 0.5 



(c). p = 0.1, 5 = 0 (d). p = 0.1, 6 = 0.5 



Fig. 7. Sum rate 7 Z vs. the number of candidate-UEs TV. P = 
30 dBm and M = 32. 


performance than the proposed if*-RUS, even though the 
online computational complexity of SUS is much higher than 
that of K *-RUS. 

Furthermore, in Fig. [71 we investigate the sum rates of 
the four schemes for different numbers of candidate-UEs, 
N, when P = 30 dBm and M = 32. It is clear that 
with the increase of N, the sum rates of if *-RUS and RUS 
remain unchanged. In contrast, if*-FUS obtains sum rate 
improvements as N increases because more multiuser diversity 
related to FSF can be exploited with larger N. For SUS, 
the sum rate decreases with N, because the pilot overhead 
becomes serious for large N, which overwhelms the increase 
of the multiuser diversity gain. 


B. Fairness Performance 

In this subsection, we evaluate the system performance in 
terms of long-term fairness among UEs. The Jain’s Fairness 
Index (JFI) f34l T, defined as 


P = 


(En=l UJ nP'n S j 


JVEtlKKn) 2 


is employed in our investigation, where 7 Z n is the rate of the 
n-th candidate-UE defined in and uj n is the probability 
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(a), p = 0, 6 = 0 (b). p = 0 ,6 = 0.5 
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(c). p = 0.1 ,6 = 0 (d). p = 0.1, 6 = 0.5 
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Fig. 8. Fairness performance T vs. transmit power P. M = 32 and N = 
64. 


Fig. 9. Fairness performance T vs. the number of candidate-UEs N . P 
30 dBm and M = 32. 


of the n-th candidate-UE being selected to be served at each 
coherence slot. 

First, we briefly analyze the long-term fairness performance 
of if*-RUS and if*-LUS without considering the channel 
estimation error and transmit correlation. Then the simulation 
results are shown in Fig. [8] and Fig. [9] 

In the case of p = 0, 6 = 0, for the n-th candidate-UE in 
if*-RUS, we have uo n = K^ m /N and 1Z n = 7££ us /ifj 


Therefore, the JFI of if*-RUS is given by 

( Krus n Ri;s \ 

\^n=l N K*n) 


^RUS = 


AT SpN ( -^RUS ^RUS A 
2^n=l ^ N K^J 


= 1. 


RUS* 


(29) 


It is clear that if*-RUS is capable of offering the optimal fair¬ 
ness among candidate-UEs. For if *-LUS, the ifpus candidate- 
UEs near to the BS are always active, whereas the other 
UEs far from the BS have little chance to be served. Thus, 
for the candidate-UE which is the i -th nearest to the BS, 
we have c Oi = 1,7 Zi = T^lus/^lus?! < i < if£ us , an d 
u)i = 0, Ui = 0, if * us < i < N. The JFI of if*-LUS is then 
calculated as 


^LUS 


iV£ 


-^LUS ^LUS^i 

Kjus) 



2 


Tf* 

^ LUS 

N 


(30) 


Usually, we have Jxus < 1 because of us < TV. Moreover, 
we can see that Jxus increases when becomes higher. 

In Fig. [8] we show the fairness among UEs against P in the 
context of various schemes with M = 32, TV = 64. In this 
simulation, we assume that the LSF CSI of each candidate- 
UE remains unchanged for 100T and the SSF CSI changes 
for each T. The window length evaluated for JFI is also 
assumed to be 100T for creating the worst scenario in terms 
of fairness. We can see that K *-RUS provides good fairness 
among UEs, and K *-LUS exhibits poor fairness performance. 
With perfect channel estimation, the JFI of if *-RUS maintains 
1 for various P, which is consistent with (|29l ). Furthermore, 
for K* -LUS, the JFI increases as P increases. This is because 
with the increase of P, more UEs can be served at the same 
time while N keeps unchanged, which coincides with (l30l ) . 


Note that we also give the simulation results concerning the 
fairness performance with imperfect channel estimation. In this 
context, the JFI of K *-RUS decreases as P increases. This is 
because the SINRs of active-UEs are different from each other 
due to the existence of the second part of the denominator 
of © (i.e., the part related to the channel estimation error). 
Moreover, the SINR difference increases when P rises. As a 
result, the fairness performance decreases. 

Finally, the fairness performance against N for M = 
32, P = 30 dBm is shown in Fig. [51 It can be observed that 
if*-RUS is capable of achieving good fairness performance, 
whereas if*-LUS has poor fairness performance. Moreover, 
the increasing of N degrades the fairness performance of if *- 
LUS, because the proportion of active-UEs in candidate-UEs 
declines although if^us increases as N becomes larger. 


V. Conclusions 

Considering the requirements of high energy efficiency and 
massive device connectivity in the future 5G communication 
systems, we have proposed a pair of low-complexity user 
selection methods for downlink massive MIMO systems in 
this paper. Taking the randomness of both the channel matrix 
and UE locations into consideration, we have obtained the 
approximations of the ergodic sum rates for the multiuser 
massive MIMO systems. By exploiting these approximations, 
if*-RUS and if*-LUS are developed, which are capable of 
significantly enhancing the system sum rate performance. 
Since no online operations related to SSF CSI are required 
in the proposed user selection algorithms, the computational 
complexity of the proposed schemes is extremely low. Besides 
the sum rate improvements, we also investigated the fairness 
among UEs and showed the remarkable fairness performance 
advantages of the proposed if*-RUS scheme. In the future, 
we will investigate low-complexity user selection methods in 
multi-cell scenarios. 


Appendix 
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Detailed derivation of (HB 

With © and ©, we have 

7 2 = - 1 1 ~ P>>P - (31) 

' tr(D 1 / 2 Z 1 RZ| f D 1 / 2 ) -1 ’ 

According to the results in (24[ Appendix III], we obtain 

tr(D 1 / 2 Z 1 RZfD 1 / 2 )- 1 ^4 -i-t^D” 1 ), 

where 0 is the unique solution of 

1 K 1. 


M 


— ~rrtr R ( I M + "tt — R 


M cj) 


-i> 


Hence, the deterministic value of 7 2 satisfies 

2 a.s. (1 p)Pc(j)M 


7 


2^=1 a k 


(32) 


(33) 


Then, we evaluate the second part of the denominator in ©. 
Since the entries of h k is independent of G, according to ll25l 
Theorem 3.4], we have 

h,R 1 / 2 Gt S s^(Gt)^(R 1 /2 ) ^hf 

^ptr(RG ff (GG")- 2 G). (34> 

Applying [25, Theorem 14.3], we can obtain 

-0 1 


tr ^RG fl (GG H )“ 2 Gj -^4 A 




tr(D _1 ), 


where 0 is given by (l32l) and ^ is defined as 


^ = T 7 tr I R 2 ( Im + 77 


M 

Thus, for (l34l) . we obtain 


M cj) 


—2 > 


h fc R 1/2 G t ss ff (G t ) ff (R 1/2 ) ff h; 




V> Ef=i 4 


(35) 


l — p j^rcf) 2 — Ip CK 


Therefore, substituting (f33l) and (f35l) into ©, we obtain ([TIT) . 
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